3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC-BY-SA
Size:
1005 entries Production Status:
Newly created-in progress
Use:
Quality Estimation
-
Paper title:Phrase Level Segmentation and Labelling of Machine Translation Errors
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Frédéric Blain | University of Sheffield | GB |
| Author 2 | Varvara Logacheva | University of Sheffield | GB |
| Author 3 | Lucia Specia | University of Sheffield | GB |
| Main Contact | Frédéric Blain | University of Sheffield | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English Portuguese
Availability:
Freely Available
License:
CC
Size:
40MB <Not Specified>Production Status:
Existing-updated
Use:
Information Extraction, Information Retrieval
-
Paper title:Semantic Links for Portuguese
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Fabricio Chalub | IBM Research | BR |
| Author 2 | Livy Real | IBM Research | BR |
| Author 3 | Alexandre Rademaker | IBM Research and EMAp/FGV | BR |
| Author 4 | Valeria de Paiva | Nuance Communications | US |
| Main Contact | Alexandre Rademaker | IBM Research and EMAp/FGV | None |
Documentation:
<Not Specified>
Written
Evaluation Package,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Şaziye Betül Özateş | Bogazici University | TR |
| Author 2 | Arzucan Özgür | Bogazici University | TR |
| Author 3 | Dragomir Radev | University of Michigan | US |
| Main Contact | Şaziye Betül Özateş | Bogazici University | None |
Documentation:
no documentationLanguage Type:
Trilingual
Languages:
English German french
Availability:
Not Available
License:
<Not Specified>
Size:
297h <Not Specified>Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The KIT Lecture Corpus for Speech Translation
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sebastian Stüker | <Not Specified> | None |
| Author 2 | Florian Kraft | <Not Specified> | None |
| Author 3 | Christian Mohr | <Not Specified> | None |
| Author 4 | Teresa Herrmann | <Not Specified> | None |
| Author 5 | Eunah Cho | <Not Specified> | None |
| Author 6 | Alex Waibel | <Not Specified> | None |
| Main Contact | Sebastian Stüker | Karlsruhe Institute of Technology | DE |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC BY-NC-SA 3.0
Size:
100000 tweets OtherProduction Status:
Newly created-finished
Use:
Sarcasm Detection
-
Paper title:Sarcasm Detection on Czech and English Twitter
-
Paper track:Sentiment Analysis, Opinion Mining and Social Media
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tomáš Hercig | Queen's University Belfast, University of West Bohemia | CZ |
| Author 2 | Ivan Habernal | University of West Bohemia | None |
| Author 3 | Jun Hong | Queen's University Belfast | None |
| Main Contact | Tomáš Hercig | University of West Bohemia | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
1.1 MByte Production Status:
Newly created-finished
Use:
Summarisation
-
Paper title:LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Annemarie Friedrich | Saarland University | DE |
| Author 2 | Marina Valeeva | Saarland University | DE |
| Author 3 | Alexis Palmer | Saarland University | US |
| Main Contact | Annemarie Friedrich | Ludwig-Maximilians-Universität München | None |
Documentation:
README in English
Written
Lexicon,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC-By-SA 3.0
Size:
5000 entries Production Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:SLIDE - a Sentiment Lexicon of Common Idioms
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Charles Jochim | IBM Research Ireland | IE |
| Author 2 | Francesca Bonin | IBM Research Ireland | IE |
| Author 3 | Roy Bar-Haim | IBM Research AI - Haifa | IL |
| Author 4 | Noam Slonim | IBM Research AI | IL |
| Main Contact | Charles Jochim | IBM Research | None |
Documentation:
<Not Specified>
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
20982 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Segmenting Hashtags using Automatically Created Training Data
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Arda Celebi | Bogazici University | TR |
| Author 2 | Arzucan Özgür | Bogazici University | TR |
| Main Contact | Arda Celebi | Bogazici University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian English
Availability:
Freely Available
License:
<Not Specified>
Size:
9387 entries Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Comparing two acquisition systems for automatically building an English–Croatian parallel corpus from multilingual websites
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Miquel Esplà-Gomis | Universitat d'Alacant | ES |
| Author 2 | Filip Klubička | University of Zagreb | HR |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Sergio Ortiz-Rojas | Prompsit Language Engenering | ES |
| Author 5 | Vassilis Papavassiliou | Institute for Language and Speech Processing / RC Athens | GR |
| Author 6 | Prokopis Prokopidis | Institute for Language and Speech Processing/Athena RC | GR |
| Main Contact | Miquel Esplà-Gomis | Universitat d'Alacant | None |
Documentation:
Documentation in English is publicly available at http://redmine.abumatran.eu/projects/en-hr-tourism-corpus/documentsLanguage Type:
Multilingual
Languages:
English Standard Arabic
Availability:
for participants in the evaluation campaign
License:
<Not Specified>
Size:
20000 <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Arabic-Segmentation Combination Strategies for Statistical Machine Translation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Saab Mansour | RWTH Aachen University | None | ||||
| Author 2 | Hermann Ney | RWTH Aachen University | DE | RWTH Aachen University | None | RWTH Aachen | DE |
| Main Contact | Saab Mansour | RWTH Aachen University | DE |
Documentation:
<Not Specified>




